Cyberbullying detection in social media has becomea critical research challengedueto therapidgrowth ofonlinecommunicationplatforms.Traditionalapproachesrelyeitheronrule-basedsystemsordeeplearningmodels, each with inherent limitations such as poor generalization or lack of interpretability. This paper proposes a hybrid ensemble framework that integrates multiple decision engines, including rule-based logic and transformer-based models,combinedwithaconfidence-awareaggregationmechanism.AnovelMulti-EngineCyberbullyingFramework (MECF), along with a Multi-Class Weighted Grading (MCWG) strategy, is introduced to improve detection robustness. The system evaluates predictions from multiple models and aggregates them using confidence-based weighted voting to produce the final classification. Experimental results on a balanced dataset demonstrate that the proposed approach achieves anF1-score of 0.91 and an AUC score of 0.962, outperforming individual models. The results highlight the effectiveness of combining heterogeneous models with confidence-based consensus for robust cyberbullying detection.
Introduction
The paper presents an AI-based system for cyberbullying detection, prevention, and response to address the growing problem of harmful online behavior on social media platforms. Traditional methods such as keyword filtering and rule-based systems are ineffective at detecting contextual, sarcastic, or implicit abusive content, while even single deep learning models often struggle with noisy and diverse social media text.
To overcome these issues, the study proposes a hybrid ensemble framework (MECF) that combines rule-based systems with transformer-based models. It introduces a Multi-Class Weighted Grading (MCWG) mechanism that assigns confidence-based weights to each model’s prediction, allowing more reliable and balanced final decisions.
The system pipeline includes data preprocessing (cleaning text, tokenization, handling imbalance), feature extraction (BERT embeddings, sentiment and linguistic features), and classification using a fine-tuned DeBERTa model along with other engines. A dataset of over 5000 social media samples (Twitter, Reddit, YouTube, Kaggle) is used for training and evaluation.
Results show strong performance, with the proposed hybrid model achieving an AUC of 0.962 and outperforming traditional ML models (SVM, LSTM) and other transformers (BERT, RoBERTa). Overall, the system improves detection accuracy, robustness, and real-time intervention capability through prevention alerts and automated content moderation actions.
Conclusion
This paper presenteda hybrid ensembleframeworkfor cyberbullying detection that integrates rule-based and transformer-based models. The proposed MCWG- based decision mechanism enables effective aggregation of multiple predictions using confidence scores.
Experimental results demonstrate that the hybrid approachimproves detectionperformancecomparedto individual models. The framework provides a balance betweenaccuracyandrobustness,makingitsuitablefor real-world applications.
Future work includes incorporating sarcasm detection, improvingpreprocessingtechniques,andextendingthe model to multilingual datasets.